Peptides having GDP exchange factor activity, nucleic acid sequences coding for these peptides, preparation and use

ABSTRACT

The present invention relates to peptides capable of modulating the levels of GDP exchange on p21-GDP complexes, the nucleic acid sequences coding for said peptides, preparation thereof and pharmaceutical compositions containing them.

The present invention relates to novel peptide and nucleotide sequences, and to their pharmaceutical use. More especially, the invention relates to peptides capable of modulating the levels of GDP exchange with p21-GDP complexes.

The products of ras genes, generally designated p21 proteins, perform a key role in the control of cell division in all eukaryotic organisms in which they have been investigated. Some specific modifications of these proteins cause them to lose their normal control and lead them to become oncogenic. Thus, a large number of human tumours have been associated with the presence of modified ras genes. Similarly, an overexpression of these p21 proteins can lead to a disturbance of cell proliferation. An understanding of the exact role of these p21 proteins in cells, their mode of functioning and their characteristics hence constitutes a major stake for an understanding of and therapeutic approach to carcinogenesis.

In vivo, the exact nature of the events responsible for the activation of p21 proteins is not yet known. It is known that they exert their functions by fluctuating between two confirmational states; an inactive form bound to GDP and an active form bound to GTP, but the factors effecting the transition between these two forms are not clearly identified. Recent studies report physiological situations during which the proportion of ras proteins bound to GTP increases in the cell. These situations comprise the activation of T lymphocytes and the stimulation of 3T3 fibroblasts by growth factors including EGF and PDGF (Downward et al., Nature 346 (1990) 719; Gibbs et al., J. Biol. Chem. 265 (1990) 20437). The increase in the proportion of p21-GTP may be at least partially explained by the action of a protein performing a role analogous to that of a receptor of the G proteins of transduction. In this connection, some proteins capable of promoting the exchange of GDP with p21 proteins have been identified, from ox brain [West et al., FEBS Lett. 259 (1990) 245] and rat brain [Wolfman and Macara, Science 248 (1990) 67]. The differing cellular localization of these under which they were obtained suggests that they are different proteins. They are as active on normal ras proteins as on those which are oncogenic. These activities are grouped together under the term GEF: Guanidine nucleotide Exchange Factor, or GRF.

In Saccharomyces cerevisiae yeast, GRF activity has been attributed to the product of the CDC25 gene [Camonis et al., EMBO J. 5 (1986) 375], and studies have been carried out in order to understand the signalling pathway involving the product of the CDC25, RAS1 and RAS2 genes on the one hand and adenylate cyclase on the other hand in Saccharomyces cerevisiae yeast. In particular, many studies have focused on characterization of the product of the CDC25 gene, which was the least known element of this chain. The product of the CDC25 gene constitutes the element furthest upstream in the cascade of reactions leading to the activation of p21 in yeast. The work carried out in this field has contributed to demonstrating that the product of this gene must act as a GDP→GTP exchange factor for activating ras proteins. A second gene of S. cerevisiae yeast, SDC25, structurally very closely related to CDC25, has been isolated and characterized. The active domain of SDC25 seems to be an exchange factor capable of acting in vitro and in vivo on ras proteins. This domain constitutes the first molecular constituent described which is endowed with this activity.

Very recently, a protein of the GRF type has also been demonstrated in mice [Vanoni and Martegani, J. Cell. Bioch. Suppl. 16B (1992) 220].

However, to date, no GRF activity has been isolated and characterized in man. The present invention is specifically the outcome of the demonstration by the Applicant of the existence of a human GDP exchange factor. The present invention is, more especially, the outcome of the identification, isolation and characterization of peptides and nucleotide sequences of human origin, designated hGRF and hSOS, capable of modulating the state of activation of p21 proteins.

A first aspect of the hence invention consists of peptides which can be used pharmaceutically. More especially, a subject of the invention lies in peptides capable of modulating the levels of exchange of GDP with p21-GDP complexes. It is understood that p21 denotes any expression product of a normal or oncogenic ras gene.

More especially, the peptides of the invention are chosen from all or part of the sequences SEQ ID no. 2, 3, 4, 6 or 8, or of a derivative of these sequences.

For the purposes of the present invention, the term derivative denotes any molecule obtained by modification, of a genetic and/or chemical nature, of these sequences and which retains the desired activity. Modification of a genetic and/or chemical nature should be understood to mean any mutation, substitution, deletion, addition and/or modification of one or more residues. Such derivatives may be generated for different purposes, such as, in particular, that of increasing the affinity of the peptide for its interaction site, that of improving its levels of production, that of increasing its resistance to proteases, that of increasing its therapeutic efficacy or of reducing its side effects, or that of endowing it with novel pharmacokinetic and/or biological properties.

In a particular embodiment of the invention, the peptides of the invention are peptides capable of stimulating the exchange of GDP with the p21-GDP complex.

In another particular embodiment of the invention, the peptides of the invention are peptides capable of slowing down or inhibiting the exchange of GDP with the p21-GDP complex. Such peptides are preferably peptides capable of antagonizing the interaction of GDP exchange factor with the p21-GDP complex. Hence they can be fragments of the sequences mentioned above or of derivatives of these sequences. Such fragments may be generated in different ways. In particular, they may be synthesized chemically on the basis of the sequences given in the present application, using the peptide synthesizers known to a person skilled in the art. They may also be synthesized genetically, by expression in a cell host of a nucleotide sequence coding for the desired peptide. In this case, the nucleotide sequence may be prepared chemically using an oligonucleotide synthesizer, on the basis of the peptide sequence given in the present application and the genetic code. The nucleotide sequence may also be prepared from the sequences given in the present application (SEQ ID no. 1, 5 and 7), by enzymatic cleavages, ligation, cloning, and the like, according to the techniques known to a person skilled in the art, or by the screening of DNA libraries with probes produced on the basis of these sequences. Moreover, the peptides of the invention, capable of slowing down or inhibiting the exchange of GDP with the p21-GDP complex, can also be peptides having a sequence corresponding to the site of interaction of the exchange factor with the p21-GDP complex.

Another subject of the invention lies in polyclonal or monoclonal antibodies or antibody fragments directed against a peptide as is defined above. Such antibodies may be generated by methods known to a person skilled in the art, in the light of the teachings given in the present application. In particular, these antibodies may be prepared by immunization of an animal against a peptide of the invention, withdrawal of blood and isolation of the antibodies. These antibodies may also be generated by the preparation of hybridomas according to the techniques known to a person skilled in the art.

More preferably, the antibodies or antibody fragments of the invention possess the capacity to inhibit at least partially the interaction of the exchange factor with the p21-GDP complex. They may thus be used to regulate the state of activation of the product of ras genes.

Moreover, these antibodies may also be used to detect and/or assay human GDP exchange factor in biological samples, and consequently to provide information about the state of activation of the product of ras genes.

The present invention hence makes it possible to generate peptides derived from the sequences SEQ ID no. 2-4, 6 and 8, as well as antibodies directed against these peptides, possessing advantageous biological properties with a view to pharmaceutical use. The biological activity of the different peptides and antibodies of the invention with respect to GDP exchange may be evaluated in different way, as illustrated in the examples.

The invention also provides compounds which are non-peptide or not exclusively peptide in nature and which can be used pharmaceutically. It is, in effect, possible, starting from the active proteinaceous units described in the present application, to produce molecules which inhibit the ras protein-dependant signalling pathway which are not exclusively peptide in nature and are compatible with pharmaceutical use. In this connection, the invention relates to the use of a polypeptide of the invention as is described above for the preparation of molecules which are non-peptide or not exclusively peptide in nature and are pharmacologically active on the levels of GDP exchange, by determination of the structural elements of this polypeptide which are important for its activity and reproduction of these elements by structures which are non-peptide or not exclusively peptide in nature. The subject of the invention is also pharmaceutical compositions comprising one or more molecules thus prepared.

The subject of the present invention is also any nucleic acid sequence coding for a polypeptide as is defined above. More preferably, the sequence in question is chosen from:

(a) all or part of the sequences SEQ ID no. 1, 5 or 7 or of their complementary strand,

(b) any sequence hybridizing with a sequence (a) and coding for a polypeptide according to the invention, and

(c) the sequences derived from the sequences (a) and (b) as a result of the degeneracy of the genetic code.

The different nucleotide sequences of the invention can be of artificial origin or otherwise. They can be genomic, cDNA, RNA sequences, hybrid sequences or synthetic or semi-synthetic sequences. These sequences may be obtained, for example, by the screening of DNA libraries (cDNA library, genomic DNA library) by means of probes produced on the basis of the sequences SEQ ID no. 1, 5 or 7. Such libraries may be prepared from cells of different origins by standard techniques of molecular biology known to a person skilled in the art. The nucleotide sequences of the invention may also be prepared by chemical synthesis, in particular according to the phosphoramidite method, or alternatively by mixed methods including the chemical or enzymatic modification of sequences obtained by the screening of libraries.

These nucleotide sequences according to the invention can be used in the pharmaceutical field, such use either being for the in vitro production of the peptides of the invention, or being for the production of anti-sense sequences or for the production of the peptides of the invention in the context of a gene therapy, or alternatively being for the detection and diagnosis, by hybridization experiments, of the expression or of an overexpression of an amplified, mutated or rearranged GDP exchange factor in biological samples, or for the isolation of homologous sequences from other cell sources.

For the production of the peptides of the invention, the nucleic acid sequences defined above are generally placed under the control of signals permitting their expression in a cell host. The choice of these signals (promoters, terminators, secretion leader sequence, and the like) can vary in accordance with the cell host used. Preferably, these nucleotide sequences of the invention form part of a vector, which can be autonomously replicating or integrative. More especially, autonomously replicating vectors may be prepared using sequences displaying autonomous replication in the chosen host. Integrative vectors, for their part, may be prepared, for example, using sequences homologous to certain regions of the host's genome, permitting integration of the vector by homologous recombination.

The cell hosts which can be used for the production of the peptides of the invention are equally well eukaryotic or prokaryotic hosts. Among eukaryotic hosts which are suitable, animal cells, yeasts or fungi may be mentioned. In particular, as regards yeasts, yeasts of the genus Saccharomyces, Kluyveromyces, Pichia, Schwanniomyces or Hansenula may be mentioned. As regards animal cells, COS, CHO, C127 cells, and the like, may be mentioned. Among fungi, Aspergillus ssp. or Trichoderma ssp. may be mentioned more especially. As prokaryotic hosts, it is preferable to use the following bacteria, E. coli, Bacillus or Streptomyces.

The nucleic acid sequences according to the invention may also be used for the production of anti-sense oligonucleotides or of genetic anti-sense sequences which can be used as pharmaceutical agents. Inhibition of the expression of certain oncogenes by anti-sense sequences has proved to be a useful strategy in understanding the role of these oncogenes, and an especially promising approach in the production of an anticancer treatment. Anti-sense sequences are small-sized oligonucleotides which is complementary to the coding strand of a given gene and which, as a result, are capable of hybridizing specifically with the transcribed mRNA, inhibiting its translation into protein. The subject of the invention is thus anti-sense sequences capable of at least partially inhibiting the production of peptides stimulating the exchange of GDP with p21-GDP complexes. Such sequences can consist of all or part of the nucleic acid sequences defined above. They are generally sequences or fragments of sequences complementary to sequences coding for peptides stimulating GDP exchange. Such oligonucleotides may be obtained from the sequences SEQ ID no. 1, 5 or 7, by fragmentation, and the like, or by chemical synthesis. Such sequences may be used in the context of gene therapies, for the transfer and in vivo expression of anti-sense sequences or of peptides capable of modulating the levels of exchanges of GDP with ras proteins. In this connection, the sequences may be incorporated in vectors, in particular of viral origin.

The invention also relates, as nucleotide sequences, to nucleotide probes, synthetic or otherwise, capable of hybridizing with the nucleotide sequences defined above which code for a peptide of the invention, or with the corresponding mRNA. Such probes may be used in vitro as a diagnostic tool, for detection of the expression of GDP exchange factor, or alternatively for the demonstration of genetic abnormalities (incorrect splicing, polymorphism, point mutations, and the like). Such probes must be labelled beforehand, and different techniques for this purpose are known to a person skilled in the art. The hybridization conditions under which these probes can be used are the normal conditions of stringency (see, in particular, the general cloning techniques below as well as the examples). These probes may also be used for the demonstration and isolation of homologous nucleic acid sequences coding for a peptide of the invention, from other cell sources, as illustrated in the examples.

The subject of the invention is also any pharmaceutical composition comprising as active principle at least one peptide as is defined above.

Its subject is also any pharmaceutical composition comprising as active principle at least one antibody and/or antibody fragment as is defined above, as well as any pharmaceutical composition comprising as active principle at least one nucleotide sequence as is defined above.

Moreover, its subject is also pharmaceutical compositions in which the peptides, antibodies and nucleotide sequence defined above are combined with one another or with other active principles.

The pharmaceutical compositions according to the invention may be used to modulate the activation of p21 proteins and, as a result, to modulate the proliferation of certain cell types. More especially, these pharmaceutical compositions are intended for the treatment of cancers. Many cancers have, in effect, been associated with the presence of oncogenic ras proteins. Among cancers most often containing mutated ras genes, there may be mentioned, in particular, adenocarcinomas of the pancreas, 90% of which have a Ki-ras oncogene mutated on the twelfth codon [Almoguera et al., Cell 53 (1988) 549], adenocarcinomas of the colon and cancers of the thyroid (50%), or carcinomas of the lung and myeloid leukaemias [30%, Bos, J. L. Cancer Res. 49 (1989) 4682].

The subject of the invention is also the use of the molecules described above for modulating the activity of p21 proteins. In particular, the invention relates to the use of these molecules for at least partially inhibiting the activation of p21 proteins.

The invention also provides a method for detection of the expression and/or of an overexpression of a ras gene in a biological sample. Such a method comprises, for example, bringing such a sample into contact with an antibody or antibody fragment according to the invention, visualization of the antigen-antibody complexes and comparison of the results obtained with a standard sample. In such a method, the antibody may be in suspension or immobilized beforehand on a support. This method can also comprise bringing the sample into contact with a nucleotide probe according to the invention, demonstration of hybrids obtained and comparison with those obtained in the case of a standard sample.

The present invention may he used in the therapeutic field: since the peptides, antibodies and nucleotide sequences of the invention are capable of modulating the activity of ras genes, they make it possible, in effect, to intervene in the process of development of cancers. As illustrated in the examples, the nucleotide sequences of the invention make it possible, in particular, to express peptides capable of complementing the temperature sensitivity of yeasts carrying a cdc25 mutation. They also make it possible to express peptides capable of suppressing a RAS2ts dominant mutation, demonstrating a competition with the normal expression product of the CDC25 gene for interaction with the p21 proteins. The invention may also be used in the field of diagnosis and typing of cancers: the antibodies and nucleotide probes of the invention make possible, in effect, the identification of cancers in which a ras gene is involved, as well as the diagnosis of cancers associated with the overexpression of a normal or oncogenic ras gene.

Other advantages of the present invention will become apparent on reading the examples which follow, which are to be considered as illustrative and non-limiting.

LEGEND TO THE FIGURES

FIG. 1: Trans-activation test: This figure shows as ordinates the levels of CAT activity (% relative to background) of recombinant CHO strains, according to the cDNA sequence used for the transformation.

FIG. 2: Test of GTP exchange activity: This figure shows as ordinates the ratio of the p21-GDP/p21-GTP forms of recombinant CHO strains, according to the cDNA sequence used for the transformation.

FIG. 3: Test of GDP exchange activity in vitro: This figure shows, in terms of time and for 2 concentrations of a peptide of the invention, the decrease in the proportion of GDP remaining bound to the p21 protein.

GENERAL CLONING TECHNIQUES

The methods traditionally used in molecular biology, such as preparative extractions of plasmid DNA, centrifugation of plasmid DNA in a caesium chloride gradient, agarose or acrylamide gel electrophoresis, purification of DNA fragments by electroelution, protein extractions with phenol or phenol/chloroform, ethanol or isopropanol precipitation of DNA in a saline medium, transformation in Escherichia coli, and the like, are well known to a person skilled in the art and are amply described in the literature [Maniatis T. et al., "Molecular Cloning, a Laboratory Manual", Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., 1982; Ausubel F. M. et al. (eds), "Current Protocols in Molecular Biology", John Wiley and Sons, New York 1987].

Restriction enzymes were supplied by New England Biolabs (Biolabs), Bethesda Research Laboratories (BRL) or, Amersham, and are used according to the suppliers' recommendations.

Plasmids of the pBR322, pUC, λgt11 and pGEX 2T type and phages of the M13 series are of commercial origin.

For ligation, the DNA fragments are separated according to size by agarose or acrylamide gel electrophoresis, extracted with phenol or with a phenol/chloroform mixture, precipitated with ethanol and then incubated in the presence of phage T4 DNA ligase (Biolabs) according to the supplier's recommendations.

The filling of 5' protruding ends is performed with the Klenow fragment of E. coli DNA polymerase I (Biolabs) according to the supplier's specifications. The destruction of 3' protruding ends is performed in the presence of phage T4 DNA polymerase (Biolabs) used according to the manufacturer's recommendations. The destruction of 5' protruding ends is performed by a controlled treatment with S1 nuclease.

Mutagenesis directed in vitro by synthetic oligodeoxynucleotides is performed according to the method developed by Taylor et al. [Nucleic Acids Res.13 (1985) 8749-8764] using the kit distributed by Amersham.

The enzymatic amplification of DNA fragments by the so-called PCR [Polymerase-catalyzed Chain Reaction, Saiki R. K. et al., Science 230 (1985) 1350-1354; Mullis K. B. and Faloona F. A., Meth. Enzym. 155 (1987) 335-350] technique is performed using a "DNA thermal cycler" (Perkin Elmer Cetus) according to the manufacturer's specifications.

Verification of the nucleotide sequences is performed by the method developed by Sanger et al. [Proc. Natl. Acad. Sci. USA, 74 (1977) 5463-5467] using the kit distributed by Amersham.

For the hybridization experiments, the normal conditions of stringency are generally as follows: hybridization: 3× SCC in the presence of 5× Denhart's at 65° C.; washing: 0.5× SSC at 65° C.

1. Isolation of the human GDP exchange factor (hGRF) gene

5×10⁵ phages of a human brain library constructed in the vector λgt11 [Skolnik et al., Cell 65 (1991) 83] were screened according to the techniques described by Sambrook, Fritsch and Maniatis (Molecular cloning; Cold Spring Harbor Laboratory Press; 1989).

The probe used for the screening of this library is a ³² P-labelled 137-base pair human cDNA fragment. This probe was prepared by PCR on the total DNA of the abovementioned library using the following degenerate oligonucleotides as primers:

    __________________________________________________________________________     -ATC                                                                               CGT                                                                               CAG GTA                                                                               CAT                                                                               CCC                                                                               CAG GTA                                                                               TGG CAC                                                                               ACA                                              T  T         T  T      T   T                                                   A  A         A  A          G  A                                                   G         G             G                                               __________________________________________________________________________

for the oligonucleotide of the 3' end of the fragment (SEQ ID no. 11), and

    ______________________________________                                         -GCA  ATT    TTT    CGG  CTT   AAG  AAG   ACT  TGG                             T     C      C      A    C     A    A     C                                    C     A             T    G                A                                    G                   C    A                G                                    ______________________________________                                    

for the oligonucleotide of the 5' end of the fragment (SEQ ID no. 12).

The probe was labelled with ³² P by random priming according to the technique of Feinberg and Vogelstein [Anal. Biochem. 137 (1984) 266], and the PCR reaction was conducted at approximately 40° C. under the conditions described in the general cloning techniques.

Among the different positive clones obtained by hybridization with this probe, one comprises the whole of an open reading frame which carries the exchange activity with respect to Ha-Ras.

This λgt11 clone contains a 3-kb cDNA which has been introduced in the form of an EcoRI fragment at the corresponding site of an M13mp18 vector. This cDNA was then sequenced using commercial "reverse" and "-20" primers and also using specific oligonucleotides according to the Sanger technique (see general cloning techniques).

The nucleotide sequence of the human GDP exchange factor gene carried by the fragment thereby obtained is presented in the sequence SEQ ID no. 1, together with the deduced peptide sequences (SEQ ID no. 2, 3 and 4).

2. Preparation of subfragments

Different derivatives or fragments of the gene thereby obtained may be prepared and used, in particular for the expression of peptides of the invention. In particular, the following fragments were prepared by enzymatic cleavage, and separated by electroelution:

a PstI-EcoRI fragment comprising a portion of the exchange factor coding region (936 up to the stop codon) as well as the 3' non-coding region of the fragment,

an EcoNI-EcoRI fragment comprising a portion of the exchange factor coding region (910 up to the stop codon) as well as the 3' non-coding region of the fragment,

an EagI-EcoRI fragment comprising a portion of the exchange factor coding region (up to the stop codon) as well as the 3' non-coding region of the fragment,

a BalI-EcoRI fragment comprising a portion of the exchange factor coding region (up to the stop codon) as well as the 3' non-coding region of the fragment, and

an NaeI-EcoRI fragment comprising a portion of the exchange factor coding region (up to the stop codon) as well as the 3' non-coding region of the fragment,

It is understood that the 3' non-coding region carried by these fragments is of secondary importance and that it may be removed, either by digestion by means of a nuclease, or by cleavage with an enzyme having a site close to the stop codon, such as, in particular, SmaI, the site for which is localized approximately 30 bp after the stop codon.

It is also understood that other fragments may be prepared, such as, in particular, fragments not containing the region coding for the entire C-terminal portion, as well as derivatives of these fragments, obtained by mutation, substitution, addition or modification of a chemical and/or genetic nature.

3. Biological characterization

The functionalities of the peptides according to the invention were tested:

in mammalian cells,

in Saccharomyces cerevisiae yeast, or alternatively

in vitro on the recombinant Ha-Ras protein.

3.1. For the functional evaluation in mammalian cells, the DNA sequences coding for peptides of the invention, such as for example, those described in Example 2, may be placed under the control of the SV40 early promoter in the vector pCym1 described by Camonis et al. [Gene 86 (1990) 263].

In this example, the PstI-EcoRI and EcoNI-EcoRI fragments described in Example 2 were inserted into this vector.

The vectors thereby obtained were tested by transient transfection into CHO cells according to the protocol described by Rey et al. [Oncogene 6 (1991) 347].

Two criteria of functionality were studied in this way:

the capacity of the vectors to trans-activate a promoter governing the expression of a reporter gene, which is, in this instance, the bacterial gene coding for chloramphenicol acetyltransferase (CAT gene),

their capacity to promote the loading with GTP of the Ras protein of transfected CHO cells.

a) For the trans-activation tests, CHO cells at 50% confluence were transfected (see, for example, the protocol described by Schweighoffer et al., Science, In Press), on the one hand with 0.5 μg of a vector carrying the CAT gene under the control of a synthetic promoter composed of the murine promoter of the thymidine kinase gene and 4 PEA1 repeat elements derived from the polyoma enhancer [Wasylyk et al., EMBO J. 7 (1988) 2475), and on the other hand with 4.5 μg of an expression vector carrying, under the control of the SV40 early promoter, no coding cDNA (lane 1), cDNA of normal Ha-Ras (lane 2), cDNA of Ha-Ras activated at position 12 (Val 12) (lane 3), the 3' end of SDC25 cDNA, described by Rey et al. cited above (lane 4), and cDNA coding for a peptide according to the invention, described above (lane 5).

The results are presented in FIG. 1. With lane 1 corresponding to the baseline activation, lane 5 (obtained for the PstI-EcoRI fragment: CDC hum.) shows that the expression of a peptide of the invention permits trans-activation of the synthetic promoter studied.

The same qualitative result was obtained for the other fragments studied.

b) So as to verify that this trans-activation does indeed involve a loading of the ras proteins of CHO cells with nucleotides, the same transient transfections were carried out with ³² P-labelled orthophosphate being added to the culture medium.

This labelling protocol, as well as that for the immunoprecipitation of cellular ras proteins, is described by Rey et al. cited above. The results obtained are presented in FIG. 2. They show that the peptides of the invention are capable of modulating the levels of exchange of GDP with ras proteins, since some of them (peptide CDC Hum. expressed by the PstI-EcoRI fragment described above) are capable of promoting the loading with GTP of the ras proteins of CHO cells immunoprecipitated with Y 13-259 antibody.

3.2. The peptides of the invention were also tested functionally in S. cerevisiae cdc25⁻ yeast. To this end, the vectors described above (3.1.), which are shuttle vectors, were introduced into the yeast strain OL97-1-11B [Camonis and Jacquet, Mol. Cell. Biol. 8 (1988) 2980]. The results obtained show that the cDNA fragments according to the invention code for peptides which are capable of complementing the lack of growth of this strain at 36° C. These results thus show that these fragments code for peptides which are functional in vivo in S. cerevisiae.

3.3. The capacity of the DNA sequences of the invention to code for peptides capable of promoting the exchange of GDP with purified Ha-Ras proteins was also demonstrated in vitro according to the protocol described by Rey et al. Mol. Cell. Biol. 9 (1989) 3904].

To this end, the sequences of the invention are expressed in E. coli strain TG1 in the form of fusion proteins with glutathione S-transferase (GST) according to the technique described by Smith and Johnson [Gene 67 (1988) 31]. To this end, the different DNA fragments described in 3.1. above were cloned, in the form of SmaI-EcoRI fragments, into the vector pGEX 2T (Pharmacia), at the 3' end of, and in frame with, a cDNA coding for GST. The SmaI-EcoRI fragments are obtained by adding an adapter by means of a ligase. The vectors thereby obtained are then used to transform E. coli strain TG1. The cells thus transformed are precultured overnight at 37° C., diluted to 1/10 in LB medium, added of IPTG to induce expression (2 hours, 25° C.) and then cultured for approximately 21 hours at 25° C. The cells are then lysed, and the fusion proteins produced are purified by affinity on an agarose-GSH column. To this end, the bacterial lysate is incubated in the presence of the gel (prepared and equilibrated with the lysis buffer) for 15 minutes at 4° C. After three washes with a Tris-HCl buffer, pH 7.4, the proteins are eluted in the presence of a Tris-HCl buffer, pH 7.7, containing an excess of GSH. The supernatant is harvested and centrifuged.

The GDP exchange activity of the peptides of the invention with respect to purified Ha-Ras proteins was then demonstrated in vitro according to the protocol described by Rey et al. (Mol. Cell. Biol. cited above). The results obtained are presented in FIG. 3. They show, in particular, that the peptide of the invention corresponding to the CDC hum sequence, expressed by the PstI-EcoRI fragment described above, stimulates GDP exchange.

4. Demonstration of homologous sequences

Nucleic acid sequences homologous to that presented in FIG. 1 were demonstrated by two different strategies:

by PCR, under the conditions described in the general cloning techniques, on cDNAs neosynthesized from placental mRNA, using as primers degenerate oligonucleotides chosen to cover sequences which are conserved between the sequence SEQ ID no. 1 and the sequence of the SOS protein [Bonfini et al., Science 255 (1992) 603].

By screening a placental cDNA library using a probe consisting of the whole of the sequence SEQ ID no. 1 labelled with ³² P. This screening was carried out under conditions of low stringency: hybridization at 50° C. in 5× SSC, 5× Denhart's medium; followed by washing at 50° C. in 2× SSC medium.

These two strategies enabled sequences homologous to the sequence SEQ ID no. 1 to be revealed. These sequences may be readily isolated and characterized. They constitute nucleic acid sequences for the purposes of the present invention when they code (or their fragments or derivatives) for peptides capable of modulating the levels of exchanges of GDP with p21-GDP complexes.

In particular, this strategy made it possible to identify, from placental mRNA and using the oligonucleotides oligo 2449 (SEQ ID no. 9) and oligo 2451 (SEQ ID no. 10), 2 cDNAs coding for factors designated hSOS1 and hSOS2, the partial sequences of which are shown in SEQ ID no. 5 and SEQ ID no. 7, respectively. The mRNAs corresponding to these factors are present in all tissues in which they have been looked for, in contrast to the factor described in Example 1 which appears to be localized only in the brain. The chromosomal localization of these genes was performed, and gave the following results:

h-GRF:15q2.4.

h-SOS1:4q2.1.

h-SOS2:14q2.2.

5. Testing for exchange factors of the h-GRF type in other tissues

An anti-h-GRF antibody was prepared in rabbits by immunization with an antigen corresponding to the fragment of 280 amino acids localized between residues 211 and 489 of the h-GRF factor presented in SEQ ID no. 4.

These antibodies enabled proteins of apparent molecular weights 30, 55, 75, 95 and 140 kDa to be detected, by ELISA, in human cortex and brain precursor cells. The diversity of molecular weights suggests the presence of multiple cDNAs. Preincubation of the anti-h-GRF antibody with h-GRF abolishes the detection of the proteins identified, thereby demonstrating the specificity of the signal.

    __________________________________________________________________________     SEQUENCE LISTING                                                               (1) GENERAL INFORMATION:                                                       (iii) NUMBER OF SEQUENCES: 12                                                  (2) INFORMATION FOR SEQ ID NO:1:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 2652 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA                                                       (iii) HYPOTHETICAL: NO                                                         (iii) ANTI-SENSE: NO                                                           (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 1..2445                                                          (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 445..2445 (SEQ ID NO 3)                                          (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 976..2445 (SEQ ID NO 4)                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1:                                       GGCGATGGCTGTAAGATCCTCCTGGACACCAGCCAGACCTTTGTGAGA48                             GlyAspGlyCysLysIleLeuLeuAspThrSerGlnThrPheValArg                               151015                                                                         CAAGGTTCCCTCATTCAGGTGCCCATGTCTGAAAAGGGCAAGATCACC96                             GlnGlySerLeuIleGlnValProMetSerGluLysGlyLysIleThr                               202530                                                                         AGGGGGCGCCTGGGGTCTCTCTCCCTAAAGAAAGAGGGCGAGCGACAG144                            ArgGlyArgLeuGlySerLeuSerLeuLysLysGluGlyGluArgGln                               354045                                                                         TGCTTCCTGTTTTCTAAGCATCTGATTATCTGTACCAGAGGCTCTGGA192                            CysPheLeuPheSerLysHisLeuIleIleCysThrArgGlySerGly                               505560                                                                         GGGAAGCTTCACTTGACCAAGAATGGAGTCATATCCCTCATTGACTGC240                            GlyLysLeuHisLeuThrLysAsnGlyValIleSerLeuIleAspCys                               65707580                                                                       ACTTTATTGGAGGAGCCAGAAAGCACGGAGGAGGAAGCCAAAGGATCC288                            ThrLeuLeuGluGluProGluSerThrGluGluGluAlaLysGlySer                               859095                                                                         GGCCAAGACATAGATCACTTGGATTTTAAAATCGGGGTGGAGCCAAAG336                            GlyGlnAspIleAspHisLeuAspPheLysIleGlyValGluProLys                               100105110                                                                      GATTCCCCGCCCTTTACAGTCATCCTAGTGGCCTCGTCCAGACAGGAG384                            AspSerProProPheThrValIleLeuValAlaSerSerArgGlnGlu                               115120125                                                                      AAGGCAGCGTGGACCAGTGACATCAGCCAGTGTGTTGGTAACATCCGA432                            LysAlaAlaTrpThrSerAspIleSerGlnCysValGlyAsnIleArg                               130135140                                                                      TGCAATGGGCTCATGATGAAGCCATTTGAAGAAAATTCCAAGGTCACT480                            CysAsnGlyLeuMetMetLysProPheGluGluAsnSerLysValThr                               145150155160                                                                   GTGCCGCAGATGATCAAGTCCGACGCCTCCTTATATTGTGATGATGTT528                            ValProGlnMetIleLysSerAspAlaSerLeuTyrCysAspAspVal                               165170175                                                                      GACATTCGCTTCAGCAAAACCATGAACTCCTGCAAAGTGCTGCAGATC576                            AspIleArgPheSerLysThrMetAsnSerCysLysValLeuGlnIle                               180185190                                                                      GCCTACGCCAGTGTGGAGCGGCTGCTGGAGAGGCTGACGGACCTGCGC624                            AlaTyrAlaSerValGluArgLeuLeuGluArgLeuThrAspLeuArg                               195200205                                                                      TTCCTGAGCATCGACTTCCTCAACACCTTCCTGCACTCCTACCGCGTC672                            PheLeuSerIleAspPheLeuAsnThrPheLeuHisSerTyrArgVal                               210215220                                                                      TTCACCACCGCCATCGTGGTCCTGGACAAGCTCATTACCATCTACAAG720                            PheThrThrAlaIleValValLeuAspLysLeuIleThrIleTyrLys                               225230235240                                                                   AAGCCTATCAGTGCCATTCCTGCCAGGTCGCTGGAGCTCCTGTTTGCC768                            LysProIleSerAlaIleProAlaArgSerLeuGluLeuLeuPheAla                               245250255                                                                      AGTGGCCAGAACAATAAGCTCCTGTACGGTGAACCCCCCAAGTCCCCG816                            SerGlyGlnAsnAsnLysLeuLeuTyrGlyGluProProLysSerPro                               260265270                                                                      CGCGCCACCCGCAAGTTCTCCTCGCCGCCACCTCTGTCCATCACCAAG864                            ArgAlaThrArgLysPheSerSerProProProLeuSerIleThrLys                               275280285                                                                      ACATCGTCACCGAGCCGCCGGCGGAAGCTCTCCCTGAACATCCCCATC912                            ThrSerSerProSerArgArgArgLysLeuSerLeuAsnIleProIle                               290295300                                                                      ATCACTGGCGGCAAGGCCCTGGACCTGGCCGCCCTCAGCTGCAACTCC960                            IleThrGlyGlyLysAlaLeuAspLeuAlaAlaLeuSerCysAsnSer                               305310315320                                                                   AATGGCTACACCAGCATGTACTCGGCCATGTCACCCTTCAGCAAGGCC1008                           AsnGlyTyrThrSerMetTyrSerAlaMetSerProPheSerLysAla                               325330335                                                                      ACGCTGGACACCAGCAAGCTCTATGTGTCCAGCAGCTTCACCAACAAG1056                           ThrLeuAspThrSerLysLeuTyrValSerSerSerPheThrAsnLys                               340345350                                                                      ATTCCAGATGAGGGCGATACGACCCCTGAGAAGCCCGAAGACCCTTCA1104                           IleProAspGluGlyAspThrThrProGluLysProGluAspProSer                               355360365                                                                      GCGCTCAGCAAGCAGAGCTCAGAAGTCTCCATGAGAGAGGAGTCAGAT1152                           AlaLeuSerLysGlnSerSerGluValSerMetArgGluGluSerAsp                               370375380                                                                      ATTGATCAAAACCAGAGTGATGATGGTGATACTGAAACATCACCAACT1200                           IleAspGlnAsnGlnSerAspAspGlyAspThrGluThrSerProThr                               385390395400                                                                   AAATCTCCAACAACACCCAAATCAGTCAAAAACAAAAATTCTTCAGAG1248                           LysSerProThrThrProLysSerValLysAsnLysAsnSerSerGlu                               405410415                                                                      TTCCCACTCTTTTCCTATAACAATGGAGTCGTCATGACCTCCTGTCGT1296                           PheProLeuPheSerTyrAsnAsnGlyValValMetThrSerCysArg                               420425430                                                                      GAACTGGACAATAACCGCAGTGCCTTGTCGGCCGCCTCTGCCTTTGCC1344                           GluLeuAspAsnAsnArgSerAlaLeuSerAlaAlaSerAlaPheAla                               435440445                                                                      ATAGCAACCGCCGGGGCCAACGAGGGCACCCCAAACAAGGAGAAGTAC1392                           IleAlaThrAlaGlyAlaAsnGluGlyThrProAsnLysGluLysTyr                               450455460                                                                      CGGAGGATGTCCTTAGCCAGTGCAGGGTTTCCCCCAGACCAGAGGAAT1440                           ArgArgMetSerLeuAlaSerAlaGlyPheProProAspGlnArgAsn                               465470475480                                                                   GGAGACAAGGAGTTTGTGATCCGCAGAGCAGCCACCAATCGTGTCTTG1488                           GlyAspLysGluPheValIleArgArgAlaAlaThrAsnArgValLeu                               485490495                                                                      AACGTGCTCCGCCACTGGGTGTCCAAGCACTCTCAGGACTTTGAGACC1536                           AsnValLeuArgHisTrpValSerLysHisSerGlnAspPheGluThr                               500505510                                                                      AACGATGAGCTCAAATGCAAGGTGATCGGCTTCCTGGAAGAAGTCATG1584                           AsnAspGluLeuLysCysLysValIleGlyPheLeuGluGluValMet                               515520525                                                                      CACGACCCGGAGCTCCTGACCCAGGAGCGGAAGGCTGCAGCCAACATC1632                           HisAspProGluLeuLeuThrGlnGluArgLysAlaAlaAlaAsnIle                               530535540                                                                      ATCAGGACTCTGACCCAGGAGGACCCAGGTGACAACCAGATCACGCTG1680                           IleArgThrLeuThrGlnGluAspProGlyAspAsnGlnIleThrLeu                               545550555560                                                                   GAGGAGATCACGCAGATGGCTGAAGGCGTGAAGGCTGAGCCCTTTGAA1728                           GluGluIleThrGlnMetAlaGluGlyValLysAlaGluProPheGlu                               565570575                                                                      AACCACTCAGCCCTGGAGATCGCGGAGCAGCTGACCCTGCTAGATCAC1776                           AsnHisSerAlaLeuGluIleAlaGluGlnLeuThrLeuLeuAspHis                               580585590                                                                      CTCGTCTTCAAGAAGATTCCTTATGAGGAGTTCTTCGGACAAGGATGG1824                           LeuValPheLysLysIleProTyrGluGluPhePheGlyGlnGlyTrp                               595600605                                                                      ATGAAACTGGAAAAGAATGAAAGGACCCCTTATATCATGAAAACCACT1872                           MetLysLeuGluLysAsnGluArgThrProTyrIleMetLysThrThr                               610615620                                                                      AAGCACTTCAATGACATCAGTAACTTGATTGCTTCAGAAATCATCCGC1920                           LysHisPheAsnAspIleSerAsnLeuIleAlaSerGluIleIleArg                               625630635640                                                                   AATGAGGACATCAACGCCAGGGTGAGCGCCATCGAGAAGTGGGTGGCC1968                           AsnGluAspIleAsnAlaArgValSerAlaIleGluLysTrpValAla                               645650655                                                                      GTAGCTGACATATGCCGCTGCCTCCACAACTACAATGCCGTACTGGAG2016                           ValAlaAspIleCysArgCysLeuHisAsnTyrAsnAlaValLeuGlu                               660665670                                                                      ATCACCTCGTCCATGAACCGCAGTGCAATCTTCCGGCTCAAAAAGACG2064                           IleThrSerSerMetAsnArgSerAlaIlePheArgLeuLysLysThr                               675680685                                                                      TGGCTCAAAGTCTCTAAGCAGACTAAAGCTTTGATTGATAAGCTCCAA2112                           TrpLeuLysValSerLysGlnThrLysAlaLeuIleAspLysLeuGln                               690695700                                                                      AAGCTTGTGTCATCTGAGGGCAGATTTAAGAATCTCAGAGAAGCTTTG2160                           LysLeuValSerSerGluGlyArgPheLysAsnLeuArgGluAlaLeu                               705710715720                                                                   AAAAATTGTGACCCACCCTGTGTCCCTTACCTGGGGATGTACCTCACC2208                           LysAsnCysAspProProCysValProTyrLeuGlyMetTyrLeuThr                               725730735                                                                      GACCTGGCCTTCATCGAGGAGGGGACGCCCAATTACACGGAAGACGGC2256                           AspLeuAlaPheIleGluGluGlyThrProAsnTyrThrGluAspGly                               740745750                                                                      CTGGTCAACTTCTCCAAGATGAGGATGATATCCCATATTATCCGAGAG2304                           LeuValAsnPheSerLysMetArgMetIleSerHisIleIleArgGlu                               755760765                                                                      ATTCGCCAGTTTCAACAAACTGCCTACAAAATAGAGCACCAAGCAAAG2352                           IleArgGlnPheGlnGlnThrAlaTyrLysIleGluHisGlnAlaLys                               770775780                                                                      GTAACGCAATATTTACTGGACCAATCTTTTGTAATGGATGAAGAAAGC2400                           ValThrGlnTyrLeuLeuAspGlnSerPheValMetAspGluGluSer                               785790795800                                                                   CTCTACGAGTCTTCTCTCCGAATAGAACCAAAACTCCCCACCTGAAGCTGTG2452                       LeuTyrGluSerSerLeuArgIleGluProLysLeuProThr                                     805810815                                                                      CCCAGCCCAGACCCAGCTGCTCCCGGGGACATGTGCTAGATGATACTGTACATATTCGTT2512               TGGTTTCACTGGATTTTCTTCTTCAGTATGTGCTTCTCCAAGAATACAAATCGTCCTTGT2572               TCTTAGATTCCTGTAGAACCGGAATATGAATTTCTGCACCGTTTCAGACTTCGCCCACCC2632               ATCCCTCCCCTCGCCCGAAT2652                                                       (2) INFORMATION FOR SEQ ID NO: 2:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 814 amino acids                                                    (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2:                                       GlyAspGlyCysLysIleLeuLeuAspThrSerGlnThrPheValArg                               151015                                                                         GlnGlySerLeuIleGlnValProMetSerGluLysGlyLysIleThr                               202530                                                                         ArgGlyArgLeuGlySerLeuSerLeuLysLysGluGlyGluArgGln                               354045                                                                         CysPheLeuPheSerLysHisLeuIleIleCysThrArgGlySerGly                               505560                                                                         GlyLysLeuHisLeuThrLysAsnGlyValIleSerLeuIleAspCys                               65707580                                                                       ThrLeuLeuGluGluProGluSerThrGluGluGluAlaLysGlySer                               859095                                                                         GlyGlnAspIleAspHisLeuAspPheLysIleGlyValGluProLys                               100105110                                                                      AspSerProProPheThrValIleLeuValAlaSerSerArgGlnGlu                               115120125                                                                      LysAlaAlaTrpThrSerAspIleSerGlnCysValGlyAsnIleArg                               130135140                                                                      CysAsnGlyLeuMetMetLysProPheGluGluAsnSerLysValThr                               145150155160                                                                   ValProGlnMetIleLysSerAspAlaSerLeuTyrCysAspAspVal                               165170175                                                                      AspIleArgPheSerLysThrMetAsnSerCysLysValLeuGlnIle                               180185190                                                                      AlaTyrAlaSerValGluArgLeuLeuGluArgLeuThrAspLeuArg                               195200205                                                                      PheLeuSerIleAspPheLeuAsnThrPheLeuHisSerTyrArgVal                               210215220                                                                      PheThrThrAlaIleValValLeuAspLysLeuIleThrIleTyrLys                               225230235240                                                                   LysProIleSerAlaIleProAlaArgSerLeuGluLeuLeuPheAla                               245250255                                                                      SerGlyGlnAsnAsnLysLeuLeuTyrGlyGluProProLysSerPro                               260265270                                                                      ArgAlaThrArgLysPheSerSerProProProLeuSerIleThrLys                               275280285                                                                      ThrSerSerProSerArgArgArgLysLeuSerLeuAsnIleProIle                               290295300                                                                      IleThrGlyGlyLysAlaLeuAspLeuAlaAlaLeuSerCysAsnSer                               305310315320                                                                   AsnGlyTyrThrSerMetTyrSerAlaMetSerProPheSerLysAla                               325330335                                                                      ThrLeuAspThrSerLysLeuTyrValSerSerSerPheThrAsnLys                               340345350                                                                      IleProAspGluGlyAspThrThrProGluLysProGluAspProSer                               355360365                                                                      AlaLeuSerLysGlnSerSerGluValSerMetArgGluGluSerAsp                               370375380                                                                      IleAspGlnAsnGlnSerAspAspGlyAspThrGluThrSerProThr                               385390395400                                                                   LysSerProThrThrProLysSerValLysAsnLysAsnSerSerGlu                               405410415                                                                      PheProLeuPheSerTyrAsnAsnGlyValValMetThrSerCysArg                               420425430                                                                      GluLeuAspAsnAsnArgSerAlaLeuSerAlaAlaSerAlaPheAla                               435440445                                                                      IleAlaThrAlaGlyAlaAsnGluGlyThrProAsnLysGluLysTyr                               450455460                                                                      ArgArgMetSerLeuAlaSerAlaGlyPheProProAspGlnArgAsn                               465470475480                                                                   GlyAspLysGluPheValIleArgArgAlaAlaThrAsnArgValLeu                               485490495                                                                      AsnValLeuArgHisTrpValSerLysHisSerGlnAspPheGluThr                               500505510                                                                      AsnAspGluLeuLysCysLysValIleGlyPheLeuGluGluValMet                               515520525                                                                      HisAspProGluLeuLeuThrGlnGluArgLysAlaAlaAlaAsnIle                               530535540                                                                      IleArgThrLeuThrGlnGluAspProGlyAspAsnGlnIleThrLeu                               545550555560                                                                   GluGluIleThrGlnMetAlaGluGlyValLysAlaGluProPheGlu                               565570575                                                                      AsnHisSerAlaLeuGluIleAlaGluGlnLeuThrLeuLeuAspHis                               580585590                                                                      LeuValPheLysLysIleProTyrGluGluPhePheGlyGlnGlyTrp                               595600605                                                                      MetLysLeuGluLysAsnGluArgThrProTyrIleMetLysThrThr                               610615620                                                                      LysHisPheAsnAspIleSerAsnLeuIleAlaSerGluIleIleArg                               625630635640                                                                   AsnGluAspIleAsnAlaArgValSerAlaIleGluLysTrpValAla                               645650655                                                                      ValAlaAspIleCysArgCysLeuHisAsnTyrAsnAlaValLeuGlu                               660665670                                                                      IleThrSerSerMetAsnArgSerAlaIlePheArgLeuLysLysThr                               675680685                                                                      TrpLeuLysValSerLysGlnThrLysAlaLeuIleAspLysLeuGln                               690695700                                                                      LysLeuValSerSerGluGlyArgPheLysAsnLeuArgGluAlaLeu                               705710715720                                                                   LysAsnCysAspProProCysValProTyrLeuGlyMetTyrLeuThr                               725730735                                                                      AspLeuAlaPheIleGluGluGlyThrProAsnTyrThrGluAspGly                               740745750                                                                      LeuValAsnPheSerLysMetArgMetIleSerHisIleIleArgGlu                               755760765                                                                      IleArgGlnPheGlnGlnThrAlaTyrLysIleGluHisGlnAlaLys                               770775780                                                                      ValThrGlnTyrLeuLeuAspGlnSerPheValMetAspGluGluSer                               785790795800                                                                   LeuTyrGluSerSerLeuArgIleGluProLysLeuProThr                                     805810                                                                         (2) INFORMATION FOR SEQ ID NO: 3:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 666 amino acids                                                    (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3:                                       MetMetLysProPheGluGluAsnSerLysValThrValProGlnMet                               151015                                                                         IleLysSerAspAlaSerLeuTyrCysAspAspValAspIleArgPhe                               202530                                                                         SerLysThrMetAsnSerCysLysValLeuGlnIleAlaTyrAlaSer                               354045                                                                         ValGluArgLeuLeuGluArgLeuThrAspLeuArgPheLeuSerIle                               505560                                                                         AspPheLeuAsnThrPheLeuHisSerTyrArgValPheThrThrAla                               65707580                                                                       IleValValLeuAspLysLeuIleThrIleTyrLysLysProIleSer                               859095                                                                         AlaIleProAlaArgSerLeuGluLeuLeuPheAlaSerGlyGlnAsn                               100105110                                                                      AsnLysLeuLeuTyrGlyGluProProLysSerProArgAlaThrArg                               115120125                                                                      LysPheSerSerProProProLeuSerIleThrLysThrSerSerPro                               130135140                                                                      SerArgArgArgLysLeuSerLeuAsnIleProIleIleThrGlyGly                               145150155160                                                                   LysAlaLeuAspLeuAlaAlaLeuSerCysAsnSerAsnGlyTyrThr                               165170175                                                                      SerMetTyrSerAlaMetSerProPheSerLysAlaThrLeuAspThr                               180185190                                                                      SerLysLeuTyrValSerSerSerPheThrAsnLysIleProAspGlu                               195200205                                                                      GlyAspThrThrProGluLysProGluAspProSerAlaLeuSerLys                               210215220                                                                      GlnSerSerGluValSerMetArgGluGluSerAspIleAspGlnAsn                               225230235240                                                                   GlnSerAspAspGlyAspThrGluThrSerProThrLysSerProThr                               245250255                                                                      ThrProLysSerValLysAsnLysAsnSerSerGluPheProLeuPhe                               260265270                                                                      SerTyrAsnAsnGlyValValMetThrSerCysArgGluLeuAspAsn                               275280285                                                                      AsnArgSerAlaLeuSerAlaAlaSerAlaPheAlaIleAlaThrAla                               290295300                                                                      GlyAlaAsnGluGlyThrProAsnLysGluLysTyrArgArgMetSer                               305310315320                                                                   LeuAlaSerAlaGlyPheProProAspGlnArgAsnGlyAspLysGlu                               325330335                                                                      PheValIleArgArgAlaAlaThrAsnArgValLeuAsnValLeuArg                               340345350                                                                      HisTrpValSerLysHisSerGlnAspPheGluThrAsnAspGluLeu                               355360365                                                                      LysCysLysValIleGlyPheLeuGluGluValMetHisAspProGlu                               370375380                                                                      LeuLeuThrGlnGluArgLysAlaAlaAlaAsnIleIleArgThrLeu                               385390395400                                                                   ThrGlnGluAspProGlyAspAsnGlnIleThrLeuGluGluIleThr                               405410415                                                                      GlnMetAlaGluGlyValLysAlaGluProPheGluAsnHisSerAla                               420425430                                                                      LeuGluIleAlaGluGlnLeuThrLeuLeuAspHisLeuValPheLys                               435440445                                                                      LysIleProTyrGluGluPhePheGlyGlnGlyTrpMetLysLeuGlu                               450455460                                                                      LysAsnGluArgThrProTyrIleMetLysThrThrLysHisPheAsn                               465470475480                                                                   AspIleSerAsnLeuIleAlaSerGluIleIleArgAsnGluAspIle                               485490495                                                                      AsnAlaArgValSerAlaIleGluLysTrpValAlaValAlaAspIle                               500505510                                                                      CysArgCysLeuHisAsnTyrAsnAlaValLeuGluIleThrSerSer                               515520525                                                                      MetAsnArgSerAlaIlePheArgLeuLysLysThrTrpLeuLysVal                               530535540                                                                      SerLysGlnThrLysAlaLeuIleAspLysLeuGlnLysLeuValSer                               545550555560                                                                   SerGluGlyArgPheLysAsnLeuArgGluAlaLeuLysAsnCysAsp                               565570575                                                                      ProProCysValProTyrLeuGlyMetTyrLeuThrAspLeuAlaPhe                               580585590                                                                      IleGluGluGlyThrProAsnTyrThrGluAspGlyLeuValAsnPhe                               595600605                                                                      SerLysMetArgMetIleSerHisIleIleArgGluIleArgGlnPhe                               610615620                                                                      GlnGlnThrAlaTyrLysIleGluHisGlnAlaLysValThrGlnTyr                               625630635640                                                                   LeuLeuAspGlnSerPheValMetAspGluGluSerLeuTyrGluSer                               645650655                                                                      SerLeuArgIleGluProLysLeuProThr                                                 660665                                                                         (2) INFORMATION FOR SEQ ID NO: 4:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 489 amino acids                                                    (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4:                                       MetTyrSerAlaMetSerProPheSerLysAlaThrLeuAspThrSer                               151015                                                                         LysLeuTyrValSerSerSerPheThrAsnLysIleProAspGluGly                               202530                                                                         AspThrThrProGluLysProGluAspProSerAlaLeuSerLysGln                               354045                                                                         SerSerGluValSerMetArgGluGluSerAspIleAspGlnAsnGln                               505560                                                                         SerAspAspGlyAspThrGluThrSerProThrLysSerProThrThr                               65707580                                                                       ProLysSerValLysAsnLysAsnSerSerGluPheProLeuPheSer                               859095                                                                         TyrAsnAsnGlyValValMetThrSerCysArgGluLeuAspAsnAsn                               100105110                                                                      ArgSerAlaLeuSerAlaAlaSerAlaPheAlaIleAlaThrAlaGly                               115120125                                                                      AlaAsnGluGlyThrProAsnLysGluLysTyrArgArgMetSerLeu                               130135140                                                                      AlaSerAlaGlyPheProProAspGlnArgAsnGlyAspLysGluPhe                               145150155160                                                                   ValIleArgArgAlaAlaThrAsnArgValLeuAsnValLeuArgHis                               165170175                                                                      TrpValSerLysHisSerGlnAspPheGluThrAsnAspGluLeuLys                               180185190                                                                      CysLysValIleGlyPheLeuGluGluValMetHisAspProGluLeu                               195200205                                                                      LeuThrGlnGluArgLysAlaAlaAlaAsnIleIleArgThrLeuThr                               210215220                                                                      GlnGluAspProGlyAspAsnGlnIleThrLeuGluGluIleThrGln                               225230235240                                                                   MetAlaGluGlyValLysAlaGluProPheGluAsnHisSerAlaLeu                               245250255                                                                      GluIleAlaGluGlnLeuThrLeuLeuAspHisLeuValPheLysLys                               260265270                                                                      IleProTyrGluGluPhePheGlyGlnGlyTrpMetLysLeuGluLys                               275280285                                                                      AsnGluArgThrProTyrIleMetLysThrThrLysHisPheAsnAsp                               290295300                                                                      IleSerAsnLeuIleAlaSerGluIleIleArgAsnGluAspIleAsn                               305310315320                                                                   AlaArgValSerAlaIleGluLysTrpValAlaValAlaAspIleCys                               325330335                                                                      ArgCysLeuHisAsnTyrAsnAlaValLeuGluIleThrSerSerMet                               340345350                                                                      AsnArgSerAlaIlePheArgLeuLysLysThrTrpLeuLysValSer                               355360365                                                                      LysGlnThrLysAlaLeuIleAspLysLeuGlnLysLeuValSerSer                               370375380                                                                      GluGlyArgPheLysAsnLeuArgGluAlaLeuLysAsnCysAspPro                               385390395400                                                                   ProCysValProTyrLeuGlyMetTyrLeuThrAspLeuAlaPheIle                               405410415                                                                      GluGluGlyThrProAsnTyrThrGluAspGlyLeuValAsnPheSer                               420425430                                                                      LysMetArgMetIleSerHisIleIleArgGluIleArgGlnPheGln                               435440445                                                                      GlnThrAlaTyrLysIleGluHisGlnAlaLysValThrGlnTyrLeu                               450455460                                                                      LeuAspGlnSerPheValMetAspGluGluSerLeuTyrGluSerSer                               465470475480                                                                   LeuArgIleGluProLysLeuProThr                                                    485                                                                            (2) INFORMATION FOR SEQ ID NO: 5:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 1092 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA                                                       (iii) HYPOTHETICAL: NO                                                         (iii) ANTI-SENSE: NO                                                           (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 1..1092                                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5:                                       ATTACTAAAATAATCCAAAGGAAAAAAATTGCAAGAGACAATGGACCA48                             IleThrLysIleIleGlnArgLysLysIleAlaArgAspAsnGlyPro                               151015                                                                         GGTCATAATATTACATTTCAGAGTTCACCTCCCACAGTTGAGTGGCAT96                             GlyHisAsnIleThrPheGlnSerSerProProThrValGluTrpHis                               202530                                                                         ATAAGCAGACCTGGGCACATAGAGACTTTTGACCTGCTCACCTTACAC144                            IleSerArgProGlyHisIleGluThrPheAspLeuLeuThrLeuHis                               354045                                                                         CCAATAGAAATTGCTCGACAACTCACTTTACTTGATTCAGATCTATAC192                            ProIleGluIleAlaArgGlnLeuThrLeuLeuAspSerAspLeuTyr                               505560                                                                         CGAGCTGTACAGCCATCAGATTTAGTTGGAAGTGTGTGGACAAAAGAA240                            ArgAlaValGlnProSerAspLeuValGlySerValTrpThrLysGlu                               65707580                                                                       GACAAAGAAATTAACTCTCCTAATCTTCTGAAAATGATTCGACATACC288                            AspLysGluIleAsnSerProAsnLeuLeuLysMetIleArgHisThr                               859095                                                                         ACCAACCTCACTCTGTGGTTTGAGAAATGTATTGTAGAAACTGAAAAT336                            ThrAsnLeuThrLeuTrpPheGluLysCysIleValGluThrGluAsn                               100105110                                                                      TTAGAAGAAAGAGTAGCTGTGGTGAGTCGAATTATTGAGATTCTACAA384                            LeuGluGluArgValAlaValValSerArgIleIleGluIleLeuGln                               115120125                                                                      GTCTTTCAAGAGTTGAACAACTTTAATGGGGTCCTTGAGGTTGTCAGT432                            ValPheGlnGluLeuAsnAsnPheAsnGlyValLeuGluValValSer                               130135140                                                                      GCTATGAATTCCTCACCTGTTTACAGACTAGACCACACATTTGAGCAA480                            AlaMetAsnSerSerProValTyrArgLeuAspHisThrPheGluGln                               145150155160                                                                   ATACCAAGTCGCCAGAAGAAAATTTTAGAAGAAGCTCATGAATTGAGT528                            IleProSerArgGlnLysLysIleLeuGluGluAlaHisGluLeuSer                               165170175                                                                      GAAGATCACTATAAGAAATATTTGGCAAAACTCAGGTCTATTAATCCA576                            GluAspHisTyrLysLysTyrLeuAlaLysLeuArgSerIleAsnPro                               180185190                                                                      CCATGTGTGCCTTTCTTTGGAATTTATCTACATAATATCTTGAAAACA624                            ProCysValProPhePheGlyIleTyrLeuHisAsnIleLeuLysThr                               195200205                                                                      GAAGAAGGCAACCCTGAGGTCCTAAAAAGACATGGAAAAGAGCTTATA672                            GluGluGlyAsnProGluValLeuLysArgHisGlyLysGluLeuIle                               210215220                                                                      AACTTTAGCAAAAGGAGGAAAGTAGCAGAAATAACAGGAGAGATCCAG720                            AsnPheSerLysArgArgLysValAlaGluIleThrGlyGluIleGln                               225230235240                                                                   CAGTACCAAAATCAGCCTTACTGTTTACGAGTAGAATCAGATATCAAA768                            GlnTyrGlnAsnGlnProTyrCysLeuArgValGluSerAspIleLys                               245250255                                                                      AGGTTCTTTGAAAACTTGAATCCGATGGGAAATAGCATGGAGAGGGAA816                            ArgPhePheGluAsnLeuAsnProMetGlyAsnSerMetGluArgGlu                               260265270                                                                      TTTACAGATTATCTTTTCAACAAATCCCTAGAAATAGAACCACGAAAC864                            PheThrAspTyrLeuPheAsnLysSerLeuGluIleGluProArgAsn                               275280285                                                                      CCTAAGCCTCTCCCAAGATTTCCAAAAAAATATACGTATCCCCTAAAA912                            ProLysProLeuProArgPheProLysLysTyrThrTyrProLeuLys                               290295300                                                                      TCTCCTGGTGTCCGGCCATCAAACCCAAGACCGGGTACCATGAGGATC960                            SerProGlyValArgProSerAsnProArgProGlyThrMetArgIle                               305310315320                                                                   CCCACCCCTCTACAGCAGGAACCACGAAAAATAAGTTATAGTAGAATA1008                           ProThrProLeuGlnGlnGluProArgLysIleSerTyrSerArgIle                               325330335                                                                      CCAGAGTCAGAGACAGAGAGTACTGCTAGTGCACCTAATTCACCAAGG1056                           ProGluSerGluThrGluSerThrAlaSerAlaProAsnSerProArg                               340345350                                                                      ACACCTCTAACACCTCCACCTGCATCAGGAACATCA1092                                       ThrProLeuThrProProProAlaSerGlyThrSer                                           355360                                                                         (2) INFORMATION FOR SEQ ID NO: 6:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 364 amino acids                                                    (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6:                                       IleThrLysIleIleGlnArgLysLysIleAlaArgAspAsnGlyPro                               151015                                                                         GlyHisAsnIleThrPheGlnSerSerProProThrValGluTrpHis                               202530                                                                         IleSerArgProGlyHisIleGluThrPheAspLeuLeuThrLeuHis                               354045                                                                         ProIleGluIleAlaArgGlnLeuThrLeuLeuAspSerAspLeuTyr                               505560                                                                         ArgAlaValGlnProSerAspLeuValGlySerValTrpThrLysGlu                               65707580                                                                       AspLysGluIleAsnSerProAsnLeuLeuLysMetIleArgHisThr                               859095                                                                         ThrAsnLeuThrLeuTrpPheGluLysCysIleValGluThrGluAsn                               100105110                                                                      LeuGluGluArgValAlaValValSerArgIleIleGluIleLeuGln                               115120125                                                                      ValPheGlnGluLeuAsnAsnPheAsnGlyValLeuGluValValSer                               130135140                                                                      AlaMetAsnSerSerProValTyrArgLeuAspHisThrPheGluGln                               145150155160                                                                   IleProSerArgGlnLysLysIleLeuGluGluAlaHisGluLeuSer                               165170175                                                                      GluAspHisTyrLysLysTyrLeuAlaLysLeuArgSerIleAsnPro                               180185190                                                                      ProCysValProPhePheGlyIleTyrLeuHisAsnIleLeuLysThr                               195200205                                                                      GluGluGlyAsnProGluValLeuLysArgHisGlyLysGluLeuIle                               210215220                                                                      AsnPheSerLysArgArgLysValAlaGluIleThrGlyGluIleGln                               225230235240                                                                   GlnTyrGlnAsnGlnProTyrCysLeuArgValGluSerAspIleLys                               245250255                                                                      ArgPhePheGluAsnLeuAsnProMetGlyAsnSerMetGluArgGlu                               260265270                                                                      PheThrAspTyrLeuPheAsnLysSerLeuGluIleGluProArgAsn                               275280285                                                                      ProLysProLeuProArgPheProLysLysTyrThrTyrProLeuLys                               290295300                                                                      SerProGlyValArgProSerAsnProArgProGlyThrMetArgIle                               305310315320                                                                   ProThrProLeuGlnGlnGluProArgLysIleSerTyrSerArgIle                               325330335                                                                      ProGluSerGluThrGluSerThrAlaSerAlaProAsnSerProArg                               340345350                                                                      ThrProLeuThrProProProAlaSerGlyThrSer                                           355360                                                                         (2) INFORMATION FOR SEQ ID NO: 7:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 1956 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA                                                       (iii) HYPOTHETICAL: NO                                                         (iii) ANTI-SENSE: NO                                                           (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 1..1956                                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7:                                       AGGTTTGAAATCCCAGAGCCAGAACCTACAGAAGCAGATAAACTAGCA48                             ArgPheGluIleProGluProGluProThrGluAlaAspLysLeuAla                               151015                                                                         CTTGAGAAAGGAGAACAACCAATCTCTGCAGATCTAAAGAGGTTCAGA96                             LeuGluLysGlyGluGlnProIleSerAlaAspLeuLysArgPheArg                               202530                                                                         AAGGAATATATCCAACCAGTACAGCTACGGGTGTTGAACGTGCAGCGG144                            LysGluTyrIleGlnProValGlnLeuArgValLeuAsnValGlnArg                               354045                                                                         CACTGGGTTGAACATCACCCCCATGACTTTGAAAGAGACTTGGAACTG192                            HisTrpValGluHisHisProHisAspPheGluArgAspLeuGluLeu                               505560                                                                         CTCGAAAGACTAGAATCCTTCACCTCAAGCGCTCACAGAGCGAAAGCA240                            LeuGluArgLeuGluSerPheThrSerSerAlaHisArgAlaLysAla                               65707580                                                                       ATGAAGAAGTGGGTAGAGAGCATCGCTAAGACCATCAGGAGGAAGAAG288                            MetLysLysTrpValGluSerIleAlaLysThrIleArgArgLysLys                               859095                                                                         CAAGCTCAGGCAAATGGAGTAAGCCATAATATTACCTTTGAAAGTCCA336                            GlnAlaGlnAlaAsnGlyValSerHisAsnIleThrPheGluSerPro                               100105110                                                                      CCTCCACCAATTGAATGGCATATCAGCAAACCAGGACAGTTTGAAACA384                            ProProProIleGluTrpHisIleSerLysProGlyGlnPheGluThr                               115120125                                                                      TTTGATCTCATGACACTTCATCCAATAGAAATTGCACGTCAGCTGACA432                            PheAspLeuMetThrLeuHisProIleGluIleAlaArgGlnLeuThr                               130135140                                                                      CTTTTGGAGTCTGATCTTTACAGGAAAGTTCAACCGTCTGAACTTGTA480                            LeuLeuGluSerAspLeuTyrArgLysValGlnProSerGluLeuVal                               145150155160                                                                   GGGAGTGTGTGGACCAAAGAAGATAAAGAAATAAATTCTCCAAATTTA528                            GlySerValTrpThrLysGluAspLysGluIleAsnSerProAsnLeu                               165170175                                                                      TTAAAAATGATTCGCCATACCACAAATCTCACCCTCTGGTTTGAAAAA576                            LeuLysMetIleArgHisThrThrAsnLeuThrLeuTrpPheGluLys                               180185190                                                                      TGCATTGTGGAAGCAGAAAATTTTGAAGAACGGGTGGCAGTACTAAGT624                            CysIleValGluAlaGluAsnPheGluGluArgValAlaValLeuSer                               195200205                                                                      AGAATTATAGAAATTCTGCAAGTTTTTCGAGATTTGAATAATTTCAAT672                            ArgIleIleGluIleLeuGlnValPheArgAspLeuAsnAsnPheAsn                               210215220                                                                      GGCGTATTGGAGATAGTCAGTGCAGTAAATTCAGTGTCAGTATACAGA720                            GlyValLeuGluIleValSerAlaValAsnSerValSerValTyrArg                               225230235240                                                                   CTAGACCATACCTTTGAGGCACTGCAGGAAAGGAAAAGGAAAATTTTG768                            LeuAspHisThrPheGluAlaLeuGlnGluArgLysArgLysIleLeu                               245250255                                                                      GACGAAGCTGTGGAATTAAGTCAAGATCACTTTAAAAAATACCTAGTA816                            AspGluAlaValGluLeuSerGlnAspHisPheLysLysTyrLeuVal                               260265270                                                                      AAACTTAAGTCAATCAATCCACCTTGTGTGCCTTTTTTTGGAATATAT864                            LysLeuLysSerIleAsnProProCysValProPhePheGlyIleTyr                               275280285                                                                      TTAACAAATATTCTGAAGACCGAAGAAGGGAATAATGATTTTTTAAAA912                            LeuThrAsnIleLeuLysThrGluGluGlyAsnAsnAspPheLeuLys                               290295300                                                                      AAGAAAGGGAAAGATTTAATCAATTTCAGTAAGAGGAGGAAAGTAGCT960                            LysLysGlyLysAspLeuIleAsnPheSerLysArgArgLysValAla                               305310315320                                                                   GAAATTACTGGAGAAATTCAGCAGTATCAGAATCAGCCGTACTGCCTA1008                           GluIleThrGlyGluIleGlnGlnTyrGlnAsnGlnProTyrCysLeu                               325330335                                                                      CGGATAGAACCAGATATGAGGAGATTCTTTGAAAACCTTAACCCCATG1056                           ArgIleGluProAspMetArgArgPhePheGluAsnLeuAsnProMet                               340345350                                                                      GGAAGTGCATGTGAAAAAGAGTTTACAGATTATTTGTTCAACAAAAGT1104                           GlySerAlaCysGluLysGluPheThrAspTyrLeuPheAsnLysSer                               355360365                                                                      TTAGAAATAGAACCACGAAATTGTAAACAGCCACCACGATTTCCACGA1152                           LeuGluIleGluProArgAsnCysLysGlnProProArgPheProArg                               370375380                                                                      AAAAGTACATTTGAACTAAAAGAACCAGGAATACGACCAAATGCAGGA1200                           LysSerThrPheGluLeuLysGluProGlyIleArgProAsnAlaGly                               385390395400                                                                   CGACATGGAGAAACAAGTGGAACAAGAGGACATCCAACACCTCTAGAA1248                           ArgHisGlyGluThrSerGlyThrArgGlyHisProThrProLeuGlu                               405410415                                                                      AGAGAACCATATAAAATAGAATTTGAAAGAATAGCTGAAACAGAACTA1296                           ArgGluProTyrLysIleGluPheGluArgIleAlaGluThrGluLeu                               420425430                                                                      GAAAGTACAGTAAGTGCACCAACAAGTCCAAATACTCCCTCAACACCA1344                           GluSerThrValSerAlaProThrSerProAsnThrProSerThrPro                               435440445                                                                      CCAGTTTCAGCATCATCAGATCACTCAGTATTTTTAGATGTAGATCTA1392                           ProValSerAlaSerSerAspHisSerValPheLeuAspValAspLeu                               450455460                                                                      AACAGTAGTCACGGATCAAACACAATCTTTGCACCTGTGCTACTACCA1440                           AsnSerSerHisGlySerAsnThrIlePheAlaProValLeuLeuPro                               465470475480                                                                   AAGTCCAAGTCTTTCTTTAGTTCATGTGGAAGTTTACATAAACTAAGT1488                           LysSerLysSerPhePheSerSerCysGlySerLeuHisLysLeuSer                               485490495                                                                      GAAGAGCCCCTGATTCCTCCTCCTCTTCCTCCTCGAAAAAAGTTTGAT1536                           GluGluProLeuIleProProProLeuProProArgLysLysPheAsp                               500505510                                                                      CATGATGGCTCAAATTCCAAGGGAAATATGAAATCTGATGATGATCCT1584                           HisAspGlySerAsnSerLysGlyAsnMetLysSerAspAspAspPro                               515520525                                                                      CCTGCTATTCCACCGAGACAGCCTCCTCCTCCAAAGGTAAAACCCAGA1632                           ProAlaIleProProArgGlnProProProProLysValLysProArg                               530535540                                                                      GTTCCTGTTCCTACTGGTGCATTTGATGGGCCTCTGCATAGTCCACCT1680                           ValProValProThrGlyAlaPheAspGlyProLeuHisSerProPro                               545550555560                                                                   CCGCCACCACCAAGAGATCCTCTTCCTGATACCCCTCCACCAGTTCCC1728                           ProProProProArgAspProLeuProAspThrProProProValPro                               565570575                                                                      CTTCGGCCTCCAGAACACTTTATAAACTGTCCATTTAATCTTCAGCCA1776                           LeuArgProProGluHisPheIleAsnCysProPheAsnLeuGlnPro                               580585590                                                                      CCTCCACTGGGGCATCTTCACAGAGATTCAGACTGGCTCAGAGACATT1824                           ProProLeuGlyHisLeuHisArgAspSerAspTrpLeuArgAspIle                               595600605                                                                      AGTACGTGTCCAAATTCGCCAAGCACTCCTCCTAGCACACCCTCTCCA1872                           SerThrCysProAsnSerProSerThrProProSerThrProSerPro                               610615620                                                                      AGGGTACCGCGTCGATGCTATGTGCTCAGTTCTAGTCAGAATAATCTT1920                           ArgValProArgArgCysTyrValLeuSerSerSerGlnAsnAsnLeu                               625630635640                                                                   GCTCATCCTCCAGCTCCCCCTGTTCCACCAAGGGAG1956                                       AlaHisProProAlaProProValProProArgGlu                                           645650                                                                         (2) INFORMATION FOR SEQ ID NO: 8:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 652 amino acids                                                    (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8:                                       ArgPheGluIleProGluProGluProThrGluAlaAspLysLeuAla                               151015                                                                         LeuGluLysGlyGluGlnProIleSerAlaAspLeuLysArgPheArg                               202530                                                                         LysGluTyrIleGlnProValGlnLeuArgValLeuAsnValGlnArg                               354045                                                                         HisTrpValGluHisHisProHisAspPheGluArgAspLeuGluLeu                               505560                                                                         LeuGluArgLeuGluSerPheThrSerSerAlaHisArgAlaLysAla                               65707580                                                                       MetLysLysTrpValGluSerIleAlaLysThrIleArgArgLysLys                               859095                                                                         GlnAlaGlnAlaAsnGlyValSerHisAsnIleThrPheGluSerPro                               100105110                                                                      ProProProIleGluTrpHisIleSerLysProGlyGlnPheGluThr                               115120125                                                                      PheAspLeuMetThrLeuHisProIleGluIleAlaArgGlnLeuThr                               130135140                                                                      LeuLeuGluSerAspLeuTyrArgLysValGlnProSerGluLeuVal                               145150155160                                                                   GlySerValTrpThrLysGluAspLysGluIleAsnSerProAsnLeu                               165170175                                                                      LeuLysMetIleArgHisThrThrAsnLeuThrLeuTrpPheGluLys                               180185190                                                                      CysIleValGluAlaGluAsnPheGluGluArgValAlaValLeuSer                               195200205                                                                      ArgIleIleGluIleLeuGlnValPheArgAspLeuAsnAsnPheAsn                               210215220                                                                      GlyValLeuGluIleValSerAlaValAsnSerValSerValTyrArg                               225230235240                                                                   LeuAspHisThrPheGluAlaLeuGlnGluArgLysArgLysIleLeu                               245250255                                                                      AspGluAlaValGluLeuSerGlnAspHisPheLysLysTyrLeuVal                               260265270                                                                      LysLeuLysSerIleAsnProProCysValProPhePheGlyIleTyr                               275280285                                                                      LeuThrAsnIleLeuLysThrGluGluGlyAsnAsnAspPheLeuLys                               290295300                                                                      LysLysGlyLysAspLeuIleAsnPheSerLysArgArgLysValAla                               305310315320                                                                   GluIleThrGlyGluIleGlnGlnTyrGlnAsnGlnProTyrCysLeu                               325330335                                                                      ArgIleGluProAspMetArgArgPhePheGluAsnLeuAsnProMet                               340345350                                                                      GlySerAlaCysGluLysGluPheThrAspTyrLeuPheAsnLysSer                               355360365                                                                      LeuGluIleGluProArgAsnCysLysGlnProProArgPheProArg                               370375380                                                                      LysSerThrPheGluLeuLysGluProGlyIleArgProAsnAlaGly                               385390395400                                                                   ArgHisGlyGluThrSerGlyThrArgGlyHisProThrProLeuGlu                               405410415                                                                      ArgGluProTyrLysIleGluPheGluArgIleAlaGluThrGluLeu                               420425430                                                                      GluSerThrValSerAlaProThrSerProAsnThrProSerThrPro                               435440445                                                                      ProValSerAlaSerSerAspHisSerValPheLeuAspValAspLeu                               450455460                                                                      AsnSerSerHisGlySerAsnThrIlePheAlaProValLeuLeuPro                               465470475480                                                                   LysSerLysSerPhePheSerSerCysGlySerLeuHisLysLeuSer                               485490495                                                                      GluGluProLeuIleProProProLeuProProArgLysLysPheAsp                               500505510                                                                      HisAspGlySerAsnSerLysGlyAsnMetLysSerAspAspAspPro                               515520525                                                                      ProAlaIleProProArgGlnProProProProLysValLysProArg                               530535540                                                                      ValProValProThrGlyAlaPheAspGlyProLeuHisSerProPro                               545550555560                                                                   ProProProProArgAspProLeuProAspThrProProProValPro                               565570575                                                                      LeuArgProProGluHisPheIleAsnCysProPheAsnLeuGlnPro                               580585590                                                                      ProProLeuGlyHisLeuHisArgAspSerAspTrpLeuArgAspIle                               595600605                                                                      SerThrCysProAsnSerProSerThrProProSerThrProSerPro                               610615620                                                                      ArgValProArgArgCysTyrValLeuSerSerSerGlnAsnAsnLeu                               625630635640                                                                   AlaHisProProAlaProProValProProArgGlu                                           645650                                                                         (2) INFORMATION FOR SEQ ID NO:9:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 41 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: other nucleic acid                                         (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (ix) FEATURE:                                                                  (A) NAME/KEY: modified.sub.-- base                                             (B) LOCATION: 15                                                               (D) OTHER INFORMATION: /mod.sub.-- base=i                                      (ix) FEATURE:                                                                  (A) NAME/KEY: modified.sub.-- base                                             (B) LOCATION: 18                                                               (D) OTHER INFORMATION: /mod.sub.-- base=i                                      (ix) FEATURE:                                                                  (A) NAME/KEY: modified.sub.-- base                                             (B) LOCATION: 21                                                               (D) OTHER INFORMATION: /mod.sub.-- base=i                                      (ix) FEATURE:                                                                  (A) NAME/KEY: modified.sub.-- base                                             (B) LOCATION: 27                                                               (D) OTHER INFORMATION: /mod.sub.-- base=i                                      (ix) FEATURE:                                                                  (A) NAME/KEY: modified.sub.-- base                                             (B) LOCATION: 30                                                               (D) OTHER INFORMATION: /mod.sub.-- base=i                                      (ix) FEATURE:                                                                  (A) NAME/KEY: modified.sub.-- base                                             (B) LOCATION: 33                                                               (D) OTHER INFORMATION: /mod.sub.-- base=i                                      (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:                                        GATATCGAATTCCGNGTNYTNAAYGTNYTNMGNCAYTGGGT41                                    (2) INFORMATION FOR SEQ ID NO:10:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 34 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: other nucleic acid                                         (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (ix) FEATURE:                                                                  (A) NAME/KEY: modified.sub.-- base                                             (B) LOCATION: 15                                                               (D) OTHER INFORMATION: /mod.sub.-- base=i                                      (ix) FEATURE:                                                                  (A) NAME/KEY: modified.sub.-- base                                             (B) LOCATION: 21                                                               (D) OTHER INFORMATION: /mod.sub.-- base=i                                      (ix) FEATURE:                                                                  (A) NAME/KEY: modified.sub.-- base                                             (B) LOCATION: 30                                                               (D) OTHER INFORMATION: /mod.sub.-- base=i                                      (ix) FEATURE:                                                                  (A) NAME/KEY: modified.sub.-- base                                             (B) LOCATION: 33                                                               (D) OTHER INFORMATION: /mod.sub.-- base=i                                      (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:                                       AAGCTTGAATTCCKNMKYTTNSWRAARTTNAKNA34                                           (2) INFORMATION FOR SEQ ID NO: 11:                                             (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 33 bases                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: simple                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: other nucleic acid                                         (iii) HYPOTHETICAL: NO                                                         (iii) ANTI-SENSE: NO                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11:                                      ATCHGTNAGGTACATNCCHAGGTAKGGNACACA33                                            (2) INFORMATION FOR SEQ ID NO: 12:                                             (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 27 bases                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: simple                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: other nucleic acid                                         (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12:                                      GCNATHTTYCGNCTNAARAARACNTGG27                                                  __________________________________________________________________________ 

We claim:
 1. An isolated peptide selected from the group consisting of:(a) SEQ ID no. 2, 3, and 4, and (b) fragments of (a), wherein said peptide is capable of stimulating the levels of exchange of GDP with p21-GDP complexes.
 2. An isolated nucleic acid encoding a peptide according to claim
 1. 3. A nucleic acid according to claim 2, having a sequence selected from the group consisting of:(a) all or part of nucleotide 1-2442 of SEQ ID no. 1, and (b) sequences derived from (a) as a result of the degeneracy of the genetic code.
 4. A method or preparing a peptide comprising culturing a cell containing a nucleic acid according to claim 2 under conditions for expression of said sequence, and recovering the produced peptide.
 5. A composition comprising a nucleic acid according to claim 2 and a pharmaceutically acceptable carrier.
 6. A vector comprising the nucleic acid of claim
 2. 7. An isolated comprising the nucleic acid of claim
 2. 8. A nucleic acid according to claim 2 having the sequence of SEQ ID no.
 1. 9. A composition comprising a peptide according to claim 1 and a pharmaceutically acceptable carrier. 